Diffusion and Cascading Behavior 
in Random Networks 

Marc Lelarge* 
Abstract 

The spread of new ideas, behaviors or technologies has been extensively studied using 
epidemic models. Here we consider a model of diffusion where the individuals' behavior is 
the result of a strategic choice. We study a simple coordination game with binary choice 
and give a condition for a new action to become widespread in a random network. We also 
analyze the possible equilibria of this game and identify conditions for the coexistence of 
both strategies in large connected sets. Finally we look at how can firms use social networks 
to promote their goals with limited information. Our results differ strongly from the one 
derived with epidemic models and show that connectivity plays an ambiguous role: while it 
allows the diffusion to spread, when the network is highly connected, the diffusion is also 
limited by high-degree nodes which are very stable. 
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1 Introduction 



There is a vast hterature on epidemics on complex networks (see ( Newman . 20031 ) for a review). 
Most of the epidemic models consider a transmission mechanism which is independent of the 
local condition faced by the agents concerned. However, if there is a factor of coordination 
or persuasion involved, relative conside rations tend to be ini portant in understanding whether 
some new belief or behavior is adopted ( Vega-Redondo . 200?! ). To fix ideas, it may be useful to 
think of the diffusion process as modeling the adoption of a new technology. In this case, when 
an agent is confronted with the possibility of adoption, her decision depends on the persuasion 
effort exerted on her by each of her neighbors in the social network. More formally, those 
neighborhood effects can be captured as follows: the probability that an agent adopts the new 
technology when r out of her d neighbors have already adopted can be modeled by a threshold 
function: this probability is one if r/d > q and it is zero otherwise. For 9 = 1/2, this would 
correspond to a local majority rule. More generally, some simple models for the diffusion of a 
new behavior have been proposed in term of a basic underlying model of individual decision- 
making: as individuals make decisions based on the choices of their neighbors, a pa r ticula r 
pattern of behavior ca n begi n to spread across the links of the network ( Vega-Redondo . 2007I ). 
(jEaslev and Kleinberd . l2010l V In this work, we analyze the diffusion in the large population 
limit when the underlying graph is a random network with given vertex degrees. There is 
now a l arge litera-ture on the study o f complex network s across dif ferent discip lines such as 
physics (Newman . 2003 1). mathematics ( Janson et al. . 2000), s ociology ( Wattd . 20021 ). networking 
(jLelarge and Bolotl . 12008) or economics (IVega-Redondd. 120071) . There i s a lso a large literature o n 
local interaction and a doption exter nalities ( Jackson and Yariv . 2007 ). ( Lopez-Pintado . 20081 ). 
(jCxaleotti et all . I2OIOI ). (lYound . lioOfll ) . Similarly to these works, we study a game where players' 
payoffs depend on the actions taken by their neighbors in the network but not on the specific 
identities of these neighbors. Our most general framework (presented in Section [3]) allows to 
deal with threshold games of complements, i.e. a player has an increasing incentive to take an 
action as more neighbors take the action. 

The main contribution of our paper is a model which allows us to study rigorously semi- 
anonymous threshold games of complements with local interactions on a complex network. We 
use a very simple dynamics for the evolution of the play: each time, agents choose the strat- 
egy with best payoff, given the current behavior of their neighbors. Stochastic versions of best 
response dynamics have been st udied extensiv ely as a simple model for the emergence of con- 
ventions (jKandori et al.l . Il993l ) , (lYoungl . Il993l ) . A key finding in this line of work is that local 
interaction may drive the system towards a pa rticular equili bri um in which all players take the 
same action. Most of the literature following (lEUisonl . 1 19931 ). (iBlumd . Il993l ) is concerned with 
stochastic versions of best response dynamics on fixed networks and a simple condition known 
as risk dominance ( Harsanyi and Selten . 19881 ) determines whether an innovation introduced in 
the network will eventually become widespread. In this ca- s e, bo unds on the convergence time 
have been computed (Young, 2000l ) . ( Montanari and Saberi . 2010l ) as a function of the structure 
of the interaction network. In this paper, we focus on p roperties of d eterministic (myopic) best 
response dynamics. In such a framework, as shown in (|Blumel . E995l ). coordination on the risk 
dominant strategy depends on the network structure, the dynamics of the game and the initial 
configuration. In particular, this coordination might fail and equilibria with co-existent strate- 
gies can exist. This was shown in (jBlumd . Il995l ) already in t he simple cas e where the underlying 
network is the two-dimensional lattice. While the results in (jBlumd . 1 19951 ) depend very much on 
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the details of the particular graph considered, ( Morris . 200d ) derives general results that depend 
on local properties of the graphs. 

More formally, agents are nodes of a fixed infinite graph and revise their strategies syn- 
chronously. There is an edge if agents i and j can interact with each other. Each node 
has a choice between two possible behaviors labeled A and B. On each edge there is an 

incentive for i and j to have their behaviors match, which is modeled as the following coordina- 
tion game parametrized by a real number q € (0, 1): if i and j choose A (resp. B), they each 
receive a payoff of q (resp. (1 — q)); if they choose opposite strategies, then they receive a payoff 
of 0. Then the total payoff of a player is the sum of the payoffs with each of her neighbors. 
If the degree of node i is di and Nf is her number of neighbors playing B, then the payoff to 
i from choosing A is q{di — Nf) while the payoff from choosing i? is (1 — q)N^ . Hence the 
best response strategy for agent i is to adopt B if > qdi and A if A^,^ < qdj. A number o f 



qualitative insights can be derived from this simple diffusion model (iBlumj . Hoosl : iMorrisl . \wodl ) . 
Clearly, a network where all nodes play A is an equilibrium of the game as is the state where 
all nodes play B. Consider now a network where all nodes initially play A and a small number 
of nodes are forced to adopt strategy B (constituting the seed). Other nodes in the network 
apply best-response updates, then these nodes will be repeatedly applying the following rule: 
switch to B if enough of your neighbors have already adopted B. There can be a cascading 
sequence of nodes switching to B such that a network- wide equilibrium is reached in the limit. 
In particular, contagion is said to occur if a,ction . B can spread from a finite set of players to the 
whole population. It is shown in dMorrisl . ^mI ) that there is a contagion threshold such that 
contagion occurs if and only if the parameter q is less than the contagion threshold. In particular 
the contagion threshold is always at most 1/2 so that B needs to be the risk -dominant action in 
order to spread. However this condition is not sufficient and (jMorrisl . lioool ) gives a number of 
characterizations of the contagion threshold and also shows that low contagion threshold implies 
the existence of equilibria where both actions are played. 

Most of the results on this model are restricted to deterministic infinite graphs and the 
results in (jMorrid . lioool ') depend on local properties of graphs that need to be checked node-by- 
node. In networks exhibiting heterogeneity among the nodes, this approach will be limited in 
its application. In this work, we analyze the diffusion in the large population limit when the 
underlying graph is a random network G(n, d) with n vertices and where d = (di)" is a given 
degree sequence (see Section 12.11 for a detailed definition) . Although random graphs are not 
considered to be highly realistic models of most real-world networks, they are often used as first 
approximation and are a natural first choice for a sp arse interaction network in the absence of 
any known geometry of the problem ( Jacksonl . l2008l ) . Our analysis yields several insights into 



how the diffusion propagates and we believe that they could be used in real-world situations. 
As an example, we show some implications of our results for the design of optim al advertising 
strategies. Our results extend the previous analysis of global cascades made in (|wattsl . liooi ) 
using a threshold model. They differ grea tly frora the s tudy of standard epidemics models used 
for the analysis of the spread of viruses (iBaileyi . Il975l ) where an entity begin as 'susceptible' 
and may become 'infected' and infectious through contacts with her neighbors with a given 
probability. Already in the simple coordination game presented above, we show that connectivity 
(i.e. the average number of neighbors of a typical agent) plays an ambiguous role: while it allows 
the diffusion to spread, when the network is highly connected, the diffusion is also limited by 
high-degree nodes, which are very stable. These nodes require a lot of their neighbors to switch 
to B in order to play B themselves. 
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In the case of a sparse random network of interacting agents, we compute the contagion 
threshold in Section 12.21 and relate it to the important notion of pivotal players. We also show 
the existence of (continuous and discontinuous) phase transitions. In Section 12.31 we analyze 
the possible equilibria of the game for low values of q. In particular, we give conditions under 
which an equilibrium with coexistence of large (i.e. containing a positive fraction of the total 
population) connected sets of players A and B is possible. In Section 12.41 we also compute the 
minimal size of a seed of new adopters in order to trigger a global cascade in the case where 
these adopters can only be sampled without any information on the graph. We show that this 
minimal size has a non-trivial behavior as a function of the connectivity. In Section [2.51 we give 
a heuristic argument allowing to recover the technical results which gives some intuition behind 
our formulas. Our results also explain why social networks can display a great stability in the 
presence of continual small shocks that are as large as the shocks that ultimately generate a 
global cascade. Cascades can therefore be regarded as a specific m anifestation of the robust yet 
fragile nature of many complex systems ( Carlson and Doyle . 20021 ): a system may appear stable 



for long periods of time and withstand many external shocks (robustness), then suddenly and 
apparently inexplicably exhibit a large scale cascade (fragility). 

Our paper is divided into two parts. The first part is contained in the next section. We 
apply our technical results to the particular case of the coordination game presented above. 
We describe our main findings and provide heuristics and intuitions for them. These results are 
direct consequences of our main theorems stated and proved in the second part of the paper. This 
second part starts with Section [31 where we present in details our most general model of diffusion 
(where initial activation and thresholds can vary among the population and also be random). 
This gen eral model allows to recoyer and extend results in the random graphs literature and is 



used in (jCoupechoux and Lelargd . l201ll ) for the analysis of the impact of clustering. We also 
state our main technical results: Theorem [10] and Theorem [TTJ Their proofs can be found in 
Sections m and [5| respectively. Section [7| contains our concluding remarks. 

Probability asymptotics: in this paper, we consider sequences of (random) graphs and 
asymptotics as the number of vertices n tends to infinity. For notational simplicity we will 
usually not show the dependency on n explicitly. All unspecified limits and other asymptotics 
statement are for n — )• oo. For example, w.h.p. (with high probability) means with probability 
tending to 1 as n — )• oo and — t-^ means c onvergence in proba bility as re — )• cx). Similarly, we 
use Op , J7p and Op in a standard way, see Janson et al. (|2000l ). For example, if X^") = X is a 



parameter of the random graph, X = Op{n) means that ¥{X > en) — > as re — )• oo for every 
e > 0, equivalently X/n — t-^ 0, or for every e > 0, \X\ < ere w.h.p. 



2 Analysis of a simple model of cascades 

In this section, we first present the model for the underlying graph. This is the only part of this 
section required to understand our main technical results in Section [3l The remaining parts of 
this section present and discuss applications and special cases exploring the significance of the 
technical results derived in the sequel. 
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2.1 Graphs: the configuration model 



n 



We consider a set [n] = {1, . . . , n} of agents interacting over a social network. Let d = {d\ ji 
{di)i be a sequence of non-negative integers such that Yl^=i di is even. For notational simpHcity 
we will usually not show the dependency on n explicitly. This sequence is the degree sequence 
of the graph: agent i € [n] has degree di, i.e. has di neighbors. We define a random multigraph 
(allowing for self- loop and n iultiple li r iks) w ith given degree sequence d, denoted by G*{n,d) 
by the configuration model ( Bollobas . 200 ll ): take a set of di half-edges for each vertex i and 
combine the half-edges into pairs by a uniformly random matching of the set of all half-edges. 
Conditioned on the multigraph G*{n, d) being a simple graph, we obtain a un iformly distribu ted 
random graph with the given degree sequence, which we denote by G{n, d) (jjansonl . l2009bl ) . 



We will let n^oo and assume that we are given d 
conditions, see (jMollov and Reedl . llOOd ): 



(dj)" satisfying the following regularity 



Condition 1. For each n, d = (dj"'^)" is a sequence of non-negative integers such that X^iLi "^i 
is even and, for some probability distribution p = (pr)^o independent of n, 



(i) \{i : di = r}\/n ^ pr for every r > as n ^ oo; 
(ii) A := Er>o^Pr ^ (O'Oo)/ 

In words, we assume that the empirical distribution of the degree sequence converges to a 
fixed probability distribution p with a finite mean A. Condition [1] (iii) ensures that p has a finite 
second moment and is technically required. It implies that the empirical mean of the degrees 
converges to A. 

In some cases, we will need the following additional condition which implies that the asymp- 
totic distribution of the degrees p has a finite third moment: 

Condition 2. For each n, d = is a sequence of non-negative integers such that df = 

0{n). 



The results of this work can be a pplied to some othe r random gr aphs models too by con- 
ditioning on the vertex degrees (see d.Ianson and Luczakl . l2007l . ^^)). For example, for the 
Erdos-Renyi graph G{n, A/n) with A € (0, oo), our results will hold with the limiting probability 
distribution p being a Poisson distribution with mean A. 



2.2 Contagion threshold for random networks 



An interesting perspective is to understand how different network structures are more or less 
hospitable to cascades. Going back to the diffusion mod el descr i bed in the Introduction, we see 
that the lower q is, the easiest the diffusion spreads. In (lMorrisl . l200(ih. the contagion threshol d 
of a connected infinite network (called the cascade capacity in ( Easlev and Kleinberg . 2O10l )) 
is defined as the maximum threshold qc at which a finite set of initial adopters can cause a 
complete cascade, i.e. the resulting cascade of adoptions of B eventually causes every node to 
switch from A to B. In this section, we consider a model where the initial adopters are forced 
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to play B forever. In this case, the diffusion is monotone and the number of nodes playing B is 
non-decreasing. We say that this case corresponds to the permanent adoption model: a player 
playing B will never play A again. We will discuss a model where the initial adopters also apply 
best-response update in Section | 



To illustrate the notion of contagion threshold, consider the infinite network depicted on 
Figure [TJ It is easy to see that the contagion threshold for such a network is Qc = \, the same 
as for the 2-dimensional lattice. Note that in the network of Figure [1] the density of nodes with 



degree 3 is | and the density of nodes of degree 4 is 
vertices have degree 4. 



9' 



whereas in the 2-dimensional lattice, all 




Figure 1: An infinite network with contagion threshold ^ and with density of nodes with degree 
3 equals to | and degree 4 equals to 



We now compute the contagion threshold for a sequence of random networks. Since a random 
network is finite and not necessarily connected, we first need to adapt the definition of contagion 
threshold to our context. We start by recalling a b asic result on the size of the largest component 
of a random graph with a given degree sequence (iMolloy and Reedl . Il995l : iJanson and Luczakl . 
20091 ) ■ namely a sufficient condition for the existence of a giant component. 

Proposition. Consider the random graph G(n,d) satisfying Conditions{l\with asymptotic de- 
gree distribution p = (pr)J^o- Z^J^i ^(^ ~ '^)Pr > then the size of the largest component of 
G{n,d) is ^}p{n). In this case, we say that there exists a giant component. 



For a graph G = {V, E) and a parameter g, we consider the largest connected component 
of the induced subgraph in which we keep only vertices of degree strictly less than q~^. We 
call the vertices in this component pivotal players: if only one pivotal player switches from A to 
B then the whole set of pivotal players will eventually switch to B in the permanent adoption 
model. For a player u € 1^, we denote by C{v, q) the final number of players B in the permanent 
adoption model with parameter when the initial state consists of only v playing i?, all other 
players playing A. Informally, we say that C(f , q) is the size of the cascade induced by player v. 

Proposition 3. Consider the random graph G(n, d) satisfying Conditions{l\andl^with asymp- 
totic degree distribution p = (pr)^o such that ^,r>i r{r — 2)pr > 0. We define qc by: 



sup < 



r(r — l)pr > rpr 



(1) 



2<r<q-^ 



Kr 
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Let he the set of pivotal players in G(n, d). 

(i) For q < Qc, there are constants < '^{q,p) < s{q,p) such that w.h.p. lim„ ^-^-^ = "y{q,p) 
and for any v £ p(")^ liminf„ '^^.^''^^ > s{q,p). 

(a) For q > qc, for an uniformly chosen player v, we have C{v,q) = Op{n). The same result 
holds if o{n) players are chosen uniformly at random. 

Note that Proposition [3] is vahd only for graphs with a giant component (for graphs with all 
components Op{n), we have trivially that C{v, q) = Op{n) for all values of q). We can rewrite ([1]) 
as follows: let D be a random variable with distribution p, i.e. ¥{D = r) = pr, then 

qc = sup {q : E [D{D - 1)1{D < q-'^)] > E[D]} . 

We can restate Proposition [3] as follows: let C^"^ be the size of a cascade induced by switching 
a random player from A to B. Proposition [3] implies that for q > qc, we have P(C''"^ > en) — t- 
as n — > oo for every e > 0, whereas for q < qc there exists constants s{q,p) > "y{q,p) > 
depending only on q and the parameters of the graph such that for every e > 0, liminf„ P(C(") > 
(s(g,p) — e)n) > ^{q,p)- Informally, we will say that global cascades (i.e. reaching a positive 
fraction of the population) occur when q < qc and do not occur when q > qc- We call qc defined 
by ([T]) the contagion threshold for the sequence of random ne tworks with degree distribution 
{pr)- This result is in accordan ce with the h euristic result of dWattd . I2OO2I ) (see in particular 



the cascade condition Eq. 5 in (jWattsl . |2002| )). Proposition [3] follows from Theorem 1111 below 
which also gives estimates for the parameters s{q,p) and 'j{q,p) when q < qc- Under additional 
technical conditions, Theorem [TT] shows that lim„ '^^n'^^ ~ ^(QiP) case (i). 

Note that the condition ^j,>i r(r — 2)pr > implies that + P2 < 1 so that for g > 1/3, 
we have, Ylr<q-^ '^(^ ~ ^)Pr — < fPr- Heucc we have the following elementary corollary: 

Corollary 4. For any random graph G{n, d) satisfying Conditions {1\ and with asymptotic 
degree distribution p = (pr)J^o such that '^r>i '^('^ ~ '^)Pr > 0; ''^^ have qc < 1/3. 



Recall that for general graphs, it is shown in ( Morrid . 200d ) that qc < 1/2. Note also that our 



bound in Corollary |3] is tight: for a r-regular network with r > 3 chosen uniformly at random 
(i.e. G{n,d) such that di = r for all i £ [n]) , Prop osition El imp lies that qc = r~^ corresponding 



to the contagion threshold of a r-regular tree, see (|Morrid . l2000l l. For r > 3, a r-regular random 
graph is connected w.h.p, so that for any q < r~^, an initial adopter will cause a complete 
cascade (i.e. reaching all players). In particular, in this case, the only possible equilibria of the 
game are all players playing A or all players playing B. We will come back to the analysis of 
equilibria of the game in Section 12.31 

Given a degree distribution p, Proposition[3]gives the typical value of qdp) under the uniform 
distribution on graphs with the prescribed degree distribution. A question of interest is to find 
among all graphs with a given degree sequence, which ones achieve the lowest or highest value of 
the contagion threshold. Indeed, the contagion threshold of a graph can be seen as a proxy for its 
susceptibility to cascades: the higher the contagion threshold is, the more susceptible the graph 
is to cascading behavior. Clearly if dmax is the maximal degree in the graph, we have qc > 
In the case of regular graphs, we saw that the typical contagion threshold is equal to this lower 
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bound: qdd) = ^. Note however that there are examples of regular graphs in ( Morrisl . 200d ) 
with a strictly higher threshold (see Figure 3 there: a 8-regular graph with contagion threshold 
3/8). Hence among all r-regular graphs, picking a graph at random will give w.h.p. a graph 
which is the least susceptible to cascades among those r-regular graphs. However it is not always 
the case that picking a graph at random among all graphs with a given degree sequence will 
result in the least susceptible one. There are situations where the typical value of the contagion 
threshold exceeds the lowest possible value: this is the case in the example presented above in 
Figured] for which the contagion threshold is | whereas the sequence of random graphs G{n,d) 



with asymptotic deg ree distribution suc h that P3 = | and p4 
i. The recent work ( Blume et al.l . [2OI1I ) has a similar focus. 



has a contagion threshold of 



r 



Figure 2: Left: (?c(A) for Erdos-Renyi random graphs and for power law graphs (dashed curve) 
as a function of the average degree A. Right: qd^) for Erdos-Renyi random graphs as a function 
of A in the range A G [1; 1.2]. 



We now consider some examples. First the case of Erdos-Renyi random graphs G{n,\/n) 
where each of the (2) edges is present with probability A/n for a fixed parameter A. In this 
case, we can apply our results with = for all r > 0, i.e. the Poisson distribution with 

mean A. As shown in Figure [2] (Left) for the case of Erdos-Renyi random graphs G{n, A/n), Qc 
is well below 1/3, indeed we have Qc < 1/4 for any value of A. As shown in Figure [2] (Right), 
we see that qc is a non-decreasing function of A the average degree in the graph for A < 2. 
Clearly on Figure [2] (Left), Qc is a non-increasing function of A, for A > 4. The second curve in 
Figure [2] corresponds to the contagion threshold for a scale- free random network whose degree 
distribution pr = (with ^(7) = '^r~^) is parametrized by the decay parameter 7 > 1. We 
see that in this case we have < 1/9. In other words, in an Erdos-Renyi random graph, in 
order to have a global cascade, the parameter q must be such that any node with no more than 
four neighbors must be able to adopt B even if it has a single adopting neighbor. In the case of 
the scale free random network considered, the parameter q must be much lower and any node 
with no more than nine neighbors must be able to adopt B with a single adopting neighbor. 
This simply reflects the intuitive idea that for widespread diffusion to occur there must be a 
sufficient high frequency of nodes that are certain to propagate the adoption. 

We also observe that in both cases, for q sufficiently low, there are two critical values for 
the parameter A, 1 < Xi{q) < Xs{q) such that a global cascade for a fixed q is only possible 
for A G {Xi{q); Xs{q)). The heuristic reason for these two thresholds is that a cascade can be 
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prematurely stopped at high-degree nodes. For Erdos-Renyi random graphs, when 1 < A < 
Xi{q), there exists a giant component, i.e. a connected component containing a positive fraction 
of the nodes. The high-degree nodes are quite infrequent so that the diffusion should spread 
easily. However, for A close to one, the diffusion does not branch much and progresses along 
a very thin tree, "almost a line", so that its progression is stopped as soon as it encounters a 
high-degree node. Due to the variability of the Poisson distribution, this happens before the 
diffusion becomes too big for A < \i{q)- Nevertheless the condition A > Xi{q) is not sufficient for 
a global cascade. Global diffusion also requires that the network not be too highly connected. 
This is reflected by the existence of the second threshold Xs{q) where a further transition occurs, 
now in the opposite direction. For A > Xs{q), the diffusion will not reach a positive fraction of 
the population. The intuition here is clear: the frequency of high-degree nodes is so large that 
diffusion cannot avoid them and typically stops there since it is unl i kely t hat a high enough 
fraction of their many neighbors eventually adopts. Following (jWattsl . |2002| ) . we say that these 
nodes are locally stable. 




Figure 3: Size s{q, A) of the cascade (in percent of the total population) for Erdos-Renyi random 
graphs as a function of A the average degree for a fixed q = 0.15. The lower curve gives the 
asymptotic fraction of pivotal players 'y{q,X). 



The proof of our Theorem [TT] makes previous intuition rigorous for a more general model of 
diffusion and gives also more insights on the nature of the phase transitions. We describe it now. 
The lower curve in Figure [3] represents the number of pivotal players in an Erdos-Renyi random 
graph as a function of A the average connectivity for q~^ = 6.666...: hence we keep only the 
largest connected component of an Erdos-Renyi random graph where we removed all vertices of 
degree greater than 6. By the same heuristic argument as above, we expect two phase transitions 
for the size of the set of pivotal players. In the proof of Theorem 111^ we show that it is indeed 
the case and moreover that the phase transitions occur at the same values Xi{q) and Xs{q) as can 
be seen on Figure [3] where the normalized size (i.e. fraction) 7(5, A) of the set of pivotal players 
is positive only for A G (Aj(g), Xs{q)). Hence a cascade is possible if and only if there is a 'giant' 
component of pivotal players. Note also that both phase transitions for the pivotal players are 
continuous, in the sense that the function A 1— > j{q, A) is continuous. This is not the case for the 
second phase transition for the normalized size of the cascade given by s{q, A) in Proposition [3 
the function A 1-^ s{q, A) is continuous in Xi{q) but not in Xs{q) as depicted on Figure[3l This has 
important consequences: around Aj(g) the propagation of cascades is limited by the connectivity 
of the network as in standard epidemic models. But around Xs{q), the propagation of cascades 
is not limited by the connectivity but by the high-degree nodes which are locally stable. 
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To better understand this second phase transition, consider a dynamical model where at 
each step a player is chosen at random and switches to B. Once the corresponding cascade is 
triggered, we come back to the initial state where every node play A before going to the next 
step. Then for A less than but very close to \s{q), most cascades die out before spreading very 
far. However, a set of pivotal players still exists, so very rarely a cascade will be triggered by 
such a player in which case the high connectivity of the network ensures that it will be extremely 
large. As A approaches Xs{q) from below, global cascades become larger but increasingly rare 
until they disappear implying a discontinuous phase transition. For low values of A, the cascades 
are infrequent and small. As A increases, their frequencies and sizes also increase until a point 
where the cascade reaches almost all vertices of the giant component of the graph. Then as A 
increases, their sizes remain almost constant but their frequencies are decreasing. 



2.3 Equilibria of the game and coexistence 

We considered so far the permanent adoption model: the only possible transitions are from 
playing A to B. There is another possible model in which the initial adopters playing B also 
apply best-response update. We call this model the non-monotonic model (by opposition to the 
permanent adoption model). In this model, if the dynamic converges, the final state will be 
an equilibrium of the game. An equilibrium of the game is a fixed point of the best response 
dynamics. For example, the states in which all players play A or all players play B are trivial 
equilibria of the game. Note that the permanent adoption model does not necessarily yield to an 
equilibrium of the game as the initial seed does not apply best response dynamics. To illustrate 
the differences between the two models consider the graph of Figured] for a value of g = 1/3: if 
the circled player switches to B, the whole network will eventually switch to B in the permanent 
adoption model, whereas the dynamic for the non-monotonic model will oscillate between the 
state where only the circled player plays B and the state where only his two neighbors of degree 
two play B (agents still revise their strategies synchronously). 



Figure 4: The circled player induces a global cascade in the permanent adoption model but not 
in the non-monotonic model for q = 1/3 (each triangle represents a long line of players). 



Clearly if a player induces a global cascade for the non-monotonic model, it will also induce 
a global cascade in the permanent adoption model. As illustrated by the case of Figure (H the 
converse is not true in general. Hence, a priori, a pivotal player as defined i n pre vious section 
might not induce a global cascade in the non-monotonic model. In (jMorrid . lioool ') . it is shown 
that if one can find a finite set of initial adopters causing a complete cascade for the permanent 
adoption model, it is also possible to find another, possibly larger but still finite, set of initial 
adopters leading to a complete cascade for the non-monotonic model. Hence the contagion 
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threshold as defined in (jMorrid . l200d ^ for infinite graphs is the 



same for both models. In om' 



case, we see that if we switch from A to B two pivotal players who are neighbors, then the 
whole set of pivotal players will eventually switch to B in the non-monotonic model. Hence 
in our analysis on random networks, we also say that the contagion threshold is the same in 
both models. Moreover, both models will have exactly the same dynamics if started with the 
set of pivotal players playing B and all other players playing A. In particular, it shows that 
the dynamic converges and reaches an equilibrium of the game. Hence we have the following 
corollary: 

Corollary 5. Consider the random graph G{n,d) satisfying Conditions[l\andlMwith asymptotic 
degree distribution p = iPr)^Q- For q < q^, there exists w.h.p. an equilibrium of the game in 
which the number of players B is more than s{q,p)n (defined in Proposition {3^ and it can be 
obtained from the trivial equilibrium with all players playing A by switching only two neighboring 
pivotal players. 

Hence for q < qc, the trivial equilibrium all A, is rather 'weak' since two pivotal players 
can induce a global cascade and there are i^p{n) such players so that switching two neighbors 
at random will after a finite number of trials (i.e. not increasing with n) induce such a global 
cascade. We call this equilibrium the pivotal equilibrium. Figure [5] shows the average number 
of trials required. It goes to infinity at both extremes Xi{q) and Xs{q). Moreover, we see that 
for most values of A inside this interval, the average number of trials is less than 2. q > qc, 
then by definition if there are pivotal players, their number must be Op{n). Indeed, in the case 
of r-regular graphs, there are no pivotal players for q > qc = f~^- Hence, it is either impossible 
or very difficult (by sampling) to find a set of players with cardinality bounded in n leading to 
a global cascade since in all cases, their number is Op{n). 



Figure 5: Average number of trials required to trigger a global cascade and reach the pivotal 
equilibrium for Erdos-Renyi random graphs G(n, A/n) as a function of A. 

We now show that equilibria with co- existent strategies exist. Here by co-existent, we mean 
that a connected fraction of players A and B are present in the equilibrium. More formally, we 
call a giant component: a subset of vertices containing a positive fraction (as n tends to infinity) 
of the total size of the graph such that the corresponding induced graph is connected. We ask 
if there exists equilibrium with a giant component of players A and i?? In the case q < qc, for 
values of A close to Xs{q), the global cascade reaches almost all nodes of the giant component 
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leaving only very small of connected component of players A. However for A close to Xi{q), this 
is not the case and co-existence is possible as shown by the following proposition: 

Proposition 6. In an Erdos-Renyi random graph G{n,X/n), for q < Qc, there exists Xdq) G 
[Xi{q), Xs{q)] such that: 

• for X € {Xi{q), Xc{q)), in the pivotal equilibrium, there is coexistence of a giant component 
of players A and a giant component of players B. 

• for X > Xc{q), in the pivotal equilibrium, there is no giant component of players A, although 
there might be a positive fraction of players A. 




Figure 6: Coexistence: the upper (red) curve is the fraction of players in the giant component 
as a function of the average degree A. The lower (green) curve is the fraction of players B in 
the pivotal equilibrium as a function of A (for q = 0.2). The last (brown) curve is the size of the 
giant component of players A in the pivotal equilibrium as a function of A. 

Figure El illustrates this proposition in the case of Erdos-Renyi random graphs G{n, X/n). 
The difference between the upper (red) and lower (green) curve is exactly the fractions of players 
A in the pivotal equilibrium while the brown curve represent the size (in percentage of the total 
population) of the largest connected component of players A in the pivotal equilibrium. This 
curve reaches zero exactly at Xc{q). Hence for A > Xc{q), we see that there is still a positive 
fraction of the population playing A, but the set of players A is divided in small (i.e. of size 
o(n)) connected components, like 'islands' of players A in a 'sea' of players B. Note also that 
for Erdos-Renyi random graphs, the value of Xc{q) is close to Xi{q). Proposition [6] follows from 
the following proposition whose proof is given in Section [6j 

We first introduce some notations: for D a random variable, we define for < x < 1, 
Dx the thinning of D obtained by taking D points and then randomly and independently 
keeping each of them with probability x, i.e. given D = d, ~ Bi((i, x). We define g{z,p) = 
J2sPs J2r>s-[sq\ ^t>sr{z) (corresponding to the function g defined in previous section for a = 
and vr = 1), X{p) = sps and 

C = sup {z < 1, X{p)z'^ = g{z,p)} . 

Proposition 7. Consider the random graph G(n,d) satisfying Conditions{l\andl^with asymp- 
totic degree distribution p = {pr)^Q- Let D be a random variable with distribution p. If q < qc, 



12 



then ^ < 1. Assume moreover that there exists e > such that g{z,p) < for z G — 
There is coexistence of a giant component of players A and a giant component of players B in 
the pivotal equilibrium if 

E [D(^{D^ - 2)l{qD >D- D^)] > 0. 



2.4 Advertising with word of mouth communication 

We consider now scenarios where A ^ [K{q), ^sig)] and the initial set of adopters grows hnearly 
with the total population n. More precisely, consider now a firm advertising to a group of 
consumers, who share product information among themselves: potential buyers are not aware 
of the existence of the product and the firm undertakes costly informative advertising. The 
firm chooses the fraction of individuals who receive advertisements. Individuals are located in a 
social network modeled by a random network G{n, d) with given vertex degrees as in previou s 
section. However co ntrary to most work on viral marketing ( Richardson and Domingoa . 20021 ). 



(|Kempe et all , booi ). we assume that the advertiser has limited knowledge about the network: 



the firm only knows the proportions of individuals having different degrees in the social network. 
One possibility for the firm is to sample individuals randomly and to decide the costly advertising 
for this individual based on her degree (i.e. her number of neighbors). The action of the firm 
is then encoded in a vector a = (ad), where ad represents the fraction of individuals with 
degree d which are directly targeted by the firm. These individuals will constitute the seed (in 
a permanent adoption model) and we call them the early adopters. Note that the case ad = ct 
for all d corresponds to a case where the firm samples individuals uniformly. This might be one 
possibility if it is unable to observe their degrees. In order to optimize its strategy, the firm 
needs to compute the expected payoff of its marketing strategy as a function of a. Our results 
allows to estimate this function in terms of cx and the degree distribution in the social network. 

We assume that a buyer might buy either if she receives a dvertisement from the f irm o r 
if she receives informatio n via word of mouth comm unication ( Ellison and Fudenberg . 19951 ). 



More precisely, following ( Galeotti and Goval . 20091 ). we consider the following general model 



for the diffusion of information: a buyer obtains information as soon as one of her neighbors 
buys the product but she decides to buy the product when Bi(A;, vr) > K{d) where d is her 
number of neighbors (i.e. degree) and k the number of neighbors having the product, Bi(A;, vr) is a 
Binomial random variable with parameters k and vr € [0, 1] and K{d) is a general random variable 
(depending on d). In words, vr is the probability that a particular neighbor does influence the 
possible buyer. This possible buyer does actually buy when the number of influential neighbors 
having bought the product exceeds a threshold K{d). Thus, the thresholds K{d) represent the 
different propensity of nodes to buy the new product when their neighbors do. The fact that 
these are possibly randomly selected is intended to model our lack of knowledge of their values 
and a possibly heterogeneous population. Note that for K{d) = and vr G [0, 1], our model of 
diffusion is exactly a contact process with probability of contagion between neighbors equals to 
vr. This mode l is also called the SI (susceptible-infected) model in ma thematical ep idemiology 
(|Bailevl . E975l l. Also for vr = 1 and K{d) = qd, we recover the model of (jMorrid . l200d l described 



previously where players A are non-buyers and players B are buyers. 

We now give the asymptotic for the final number of buyers for the case vr G [0, 1] and 
K{s) = qs, with q < 1 (the general case with a random threshold is given in Theorem llOp. 
We first need to introduce some notations. For integers s > and < r < s let bgr denote 
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the binomial probabilities bsr{p) '■= P(Bi(s,p) = r) = — pY ^. Given a distribution 

P = (Ps)seN; we define the functions: 

h{z;a,p,7r) := "^{l - as)ps ^ r6sr(l - vr + vrz), 

s r>s—[sq\ 

g{z;a,p,-K) := X{p)z{l — it + ttz) — h{z; a,p,iT), 
hi{z;a,p,TT) := '^{1 - as)ps ^ 6sr(l - vr + vrz), 

s i"'^s—lsq\ 

where \{p) = J2s ^Ps- We define 

z{a,p, vr) := max {z € [0, 1] : g{z; a,p,TT) = 0} . 

We refer to Section [2.51 for an intuition behind the definitions of the functions h,hi,g and z in 
terms of a branching process approximation of the local structure of the graph. 

Proposition 8. Consider the random graph G{n,d) for a sequence (di)^ satisfying Condition 
[H // the strategy of the firm is given by a, then the final number of buyers is given by {1 — 
hi{z, a,p, 7r))n + Op(n) provided z{a,p, vr) = 0, or z{a,p, n) G (0, 1], and further g{z] a,p,Tr) < 
for any z in some interval {z — e,z). 

To illustrate this result, we consider the simple case of Erdos-Renyi random graphs G{n, X/n) 
with TT = 1, ad = a for all d, and q > (/c(A) where qd^) is the contagion threshold for this network 
(defined in previous section). For simplicity, we omit to write explicitly the dependence in vr 
and replace the dependence in p by the parameter A. 

As we will see, for some values of A there is a phase transition in a. For a certain value ac(A), 
we have: if a < ac{\), the size of the diffusion is rather small whereas it becomes very large 
for a > ac{X)- Clearly the advertising firm has a high interest in reaching the 'critical mass' 
of Qc(A)n early adopters. This phase transition is reminiscent of the one described in previous 
section. Recall that we are now in a setting where q > qd^) so that a global cascade triggered by 
a single (or a small number of) individual(s) uniformly sampled is not possible. Hence, the firm 
will have to advertise to a positive fraction of individuals constituting the seed for the diffusion. 
The intuition is the following: if the seed is too small, then each early adopter starts a small 
cascade which is stopped by high-degree nodes. When the seed reaches the critical mass, then 
the cascades 'coalesce' and are able to overcome 'barriers' constituted by high-degree nodes so 
that a large fraction of the nodes in the 'giant component' of the graph adopt. Then increasing 
the size of the seed has very little effect since most of the time, the new early adopters (chosen 
at random by the firm) are already reached by the global diffusion. Our Theorem 1101 makes this 
heuristic rigorous for the general model of diffusion described above (with random thresholds). 

We now give some more insights by exploring different scenarios for different values of q. 
Consider first a case where q < 1/4, then thanks to the results of previous section, we know that 
there exists 1 < Xi{q) < Xsil) such that for any A G {Xi{q), Xs{q)), a global cascade is possible 
with positive probability if only one random player switches to B. In particular, if a fraction 
a of individuals uniformly sampled are playing A, then for any a > 0, such a global cascade 
will occur with high probability. For A > Xs{q), we know that a single player cannot trigger a 
global cascade. More precisely, if players are chosen at random without any information on the 
underlying network, any set of initial adopters with size o{n) cannot trigger a global cascade, as 
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Figure 7: Case q < 1/4. The green (upper) curve is the size of the final set of players B for 
a = ac{X) given by A i-> l — hi{z, adX)) as a function of A. The brown (lower) curve corresponds 
to the minimal size of the seed for a global cascade A i-> adX)- The red curve corresponds to 
the size of a global cascade when a = similar as in Figure [3l 

shown by Theorem [TTJ However the final size of the set of players playing S is a discontinuous 
function of a the size of the initial seed. If ac(A) is defined as the point at which this function is 
discontinuous, we have: for a < ac(A), the final set of players B will be only slightly larger than 
a but if a > Q!c(A) the final set of players B will be very large. Hence we will say that there is a 
global cascade when a > ac(A) and that there is no global cascade when a < ac(A). As shown 
in Figure El for q < 1/4, we have adX) = for A G [Xi{q), Xs{q)] and ac(A) > for A > Xsiq)- 
In Figure [71 we also see that our definition of global cascade when A > Xsil) is consistent with 
our previous definition since the function A i— > 1 — hi{z,ac{X)) giving the size of the final set 
of players B when the seed has normalized size adX) is continuous in A and agrees with the 
previous curve for the size of the final set of players B when A < Xs{q) and with only one early 
adopter. 




l.S 2.0 2.2 2.4 2.6 2.8 3.0 



Figure 8: Case q > 1/4. Minimal size of the seed for a global cascade A i— )• adX) as a function 
of A. 

We now consider the case where q > 1/4. In this case, thanks to the results of previous 
section, we know that for any value of A, a single initial player B (sampled uniformly) cannot 
trigger a global cascade. But our definition of ac(A) still makes sense and we now have ac(A) > 
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for all A. Figure E] gives a plot of the function A i— )• adX). We see that again there are two 
regimes associated with the low/high connectivity of the graph. For low values of A, the function 
A I—)- ac(A) is non-increasing in A. This situation corresponds to the intuition that is correct for 
standard epidemic models according to which an increase in the connectivity makes the diffusion 
easier and hence the size of the critical initial seed will decrease. 




Figure 9: Case q > 1/4, low connectivity regime. Size of the final set of players B, 
a I—)- 1 — hi{z,a) as a function of the size of the initial seed a for different values of 
A = 1.60,1.61,1.62,1.63,1.64. 

Figure [9] shows the size 1 — hi{z,a) of the final set of players B as a function of the size of 
the initial seed a for various small values of A. We see that for the smallest values of A, there 
is no discontinuity for the function a i— )■ 1 — hi{z,a). In this case ac(A) is not defined. We 
also see that there is a natural monotonicity in A: as the connectivity increases, the diffusion 
also increases. However Figure [8] shows that for A > 2, the function A i— > ac(A) becomes non- 
decreasing in A. Hence even if the connectivity increases, in order to trigger a global cascade 
the size of the initial seed has to increase too. The intuition for this phenomenon is now clear: 
in order to trigger a global cascade, the seed has to overcome the local stability of high-degree 
nodes which now dominates the effect of the increase in connectivity. 

Figure [TOl shows the size 1 — hi{z,a) of the final set of players 5 as a function of the size 
of the initial seed a for various value of A. Here we see that connectivity hurts the start of the 
diffusion: for small value of a, an increase in A results in a lower size for the final set of players 
Bl However when a reaches the critical value ac(A), then a global cascade occurs and its size 
is an increasing function of A. In other words, high connectivity inhibits the global cascade but 
once it occurs, it facilitates its spread. 

2.5 Local Mean Field approximation 

In this subsection, we say that a player A is inactive and a player B is active. We describe 
an approximation to the local structure of the graph and present a heuristic argument which 
leads easily to a prediction for the asymptotic probability of being active. This heuristic allows 
us to recover the formulas for the functions h,g,hi. Results from this section are not needed 
for the rigorous proofs given in Sections |4] and [5] and are only presented to give some intuition 
about our technical formulas. This branching process approximation is standard in the random 
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Figure 10: Case q > 1/4, high connectivity regime. Size of the final set of players B, a 
1 — hi(z, a) as a function of the size of the initial seed a for different values of A = 2, 3, 4 (red, 
green, brown respectively). 



graphs literature ()Durrettl . 120071 ) and is related to the generating function approach in physic s 
WattsI (I2OO2I I. It is called Local Mean Field (LMF) in f|Lelarge and Bolotl . liooi : iLelargel . [20091 ^ 



and in the particular case of an epidemics with independent contagion on each edge, the LMF 
approximation was turned into a rigorous argument. For our model of diffusion with threshold, 
this seems unlikely to be straightforward and our proof will not use the LMF approximation. 

The LMF model is characterized by its connectivity distribution p defined in Condition 
^\ We now construct a tree-indexed process. Let T be a Galton- Watson branching process 
(jAthreva and Nev . 1972 ) with a root which has offspring distribution pr and all other nodes 
have offspring distribution p* given by p^-i = ^ foi" all r > 1. Re call that i P*_^ c orresponds 



to the probability that an edge points to a node with degree r, see (jPurrettl . 120071 ). The tree 
T describes the local structure of the graph G (as n tends to infinity): the exploration of the 
successive neighborhoods of a given vertex is approximated by the branching process T as long 
as the exploration is local (typically restricted to a finite neighborhood independent of n). 

We denote by the root of the tree and for a node i, we denote by gen(i) G N the generation 
of i, i.e. the length of the minimal path from to i. Also we denote i — > j if z belongs to the 
children of j, i.e. gen(i) = gen(j) + 1 and j is on the minimal path from to i. For an edge 
with i —7- j, we denote by Ti^j the sub-tree of T with root i obtained by the deletion of 
edge (i,j) from T. 

We now consider the diffusion model described in Section [2.41 (with threshold for a vertex of 
degree d given by K{d) = qd and probability for a neighbor to be influential given by tt E [0, 1]) 
on the tree T, where the initial set of active nodes is given by a vector a = (o"j)jg[„]: if (jj = 1 
then vertex i is in the initial set of active vertices, otherwise cxj = 0. In our model the cjj's 
are independent Bernoulli random variables with parameter a^- where di is the degree of node 
i in the tree. Thanks to the tree structure, it is possible to compute the probability of being 
active recursively as follows: for any node z 7^ 0, let 1^ = 1 if node i is active on the sub-graph 
Ti^j U with initial set of active nodes given by the restriction of a to individuals in Tj_j.j 

and with individual j held fix in the inactive state. Then for any node i 7^ of degree di, i 
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becomes active if the number of active influential children exceeds qdi. Hence, we get 



Yi = l-{l-ai)l\Y,BHYi<qd, 



(2) 



where the Bus are independent Bernoulli random variables with parameter vr. Then the state 
of the root is given by 



In order to compute the distribution of we first solve the Recursive Distributional Equation 
(RDE) associated to the l^'s: thanks to the tree structure, the random variables 1^ in ([2]) are 
i.i.d. and have the same distribution as 5^. Hence their distribution solve the RDE given by 



where for a given d, the random variable (j{d) is Bernoulli with parameter ad, Bi's are inde- 
pendent Bernoulli with parameter vr, D* has distribution p*, Y and the 1^'s are i.i.d. copies 
(with unknown distribution). To solve the RDE (jH, we need to compute only the mean of 
the Bernoulli random variable Y. Hence taking expectation in (j4]) directly gives a fixed-point 
equation for this mean and the following lemma follows (its proof is deferred to Appendix lA.ip : 

Lemma 9. The solutions of the RDE ^ are Bernoulli random variable with mean x, where 
X is any solution of the fixed point equation: A(l — x){l — xtt) = h{l — x;a,p,TT). For any 
such solution, the random variable X0 defined in ^ is a Bernoulli random variable with mean 
1 — hi{l — x; a,p, tt). 

By the change of variable z = 1 — x, we see that Lemma [9] is consistent with Proposition [8] 
and allow to recover the functions h, g, hi. However note that, the RDE Q has in general several 
solutions one being the trivial one with x = 1 corresponding to the trivial equilibrium where 
every node are active. The fact that this RDE has several solutions is a major difficulty in order 
to turn this LMF approximation into a rigorous argument. Finally, the crucial point allowing 
previous computation is the fact that in recursion ([2]) the Yi can be computed "bottom-up" so 
that the 1^'s of a given generation (from the root) are independent. The l^'s in ([3|) encode the 
information that i is activated by a node in the subtree of T "below" i (and not by the root). 
If one considers a node in the original graph and runs a directed contagion model on a local 
neighborhood of this node where only 'directed' contagion toward this node are allowed, then 
the state of the graph seen from this node is well approximated by the l^'s. 

3 General model and main results 

We first present the model for the diffusion process and then our main results for the spread of 
the diffusion. 




(3) 



Y £ 1 - (1 - a{D* + 1))1 B,Yi < q{D* + 1) 




(4) 
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3.1 Diffusion: percolated threshold model 



In this section, we describe the diffusion for any given finite graph G with vertex set [n]. We 
still denote by di the degree of node i € [n]. From now on, a vertex i is either active or inactive. 
In our model, the initial set of active nodes S (the seed) will remain active during the whole 
process of the diffusion (results for the non-monotonic model are derived in Section [6]). We 
will consider single activation where the seed contains only one vertex and degree based 
activation where the vertex i is in the seed with probability ad^, where di is the degree of the 
vertex independently of the others. In other words, each node i draws independently of each 
other a Bernoulli random variable cjj with parameter and is considered as initially active 
if cjj = 1 and not initially active otherwise. In the case of degree based activation, we denote 
= {ctd) the parameters of this activation. In particular, ii Od = a for all d, then a fraction a 
chosen uniformly at random among the population is activated before the diffusion takes place. 

Symmetric threshold model: We first present the sym metric threshold model which 



generalizes the bootstrap percolation (jBalogh and Pittell . 120071 ): a node becomes active when 



a certain threshold fraction of neighbors are already active. We allow the threshold fraction 
to be a random variable with distribution depending on the degree of the node and such that 
thresholds are independent among nodes. Formally, we define for each node i a sequence of 
random variables in N denoted by (^j(d))^i- The threshold associated to node i is ki = Ki{di) 
where di is the degree of node i. We assume that for any two vertices i and j, the sequences 
{Ki{d))'^^ and {Kj{d))'^^ are independent and have the same law as a sequence denoted by 
{K{d))X^^. For < £ < s, we denote tsi = F{K{s) = £) the probab i lity d istribution of the 
threshold for a node of degree s. For example in the model of (jMorrid . I2OO0I ) described in the 



introduction, we take K{d) = [qd\ so that tgi = l{[qs\ = £)■ We will use the notation k = {ki)^ 
and t = (tse)o<e<s denotes the distribution of thresholds. 

Now the progressive dynamic of the diffusion on the finite graph G operates as follows: some 
set of nodes S starts out being active; all other nodes are inactive. Time operates in discrete 
steps t = 1, 2, 3, . . . . At a given time t, any inactive node i becomes active if its number of active 
neighbors is at least Ki{di) + 1. This in turn may cause other nodes to become active. It is 
easy to see that the final set of active nodes (after n time steps if the network is of size n) only 
depends on the initial set S (and not on the order of the activations) and can be obtained as 
follows: set Xi = \{i ^ S) for all i. Then as long as there exists i such that ^j^iXj > Ki{di), 
set Xi = 1, where j ~ i means that i and j share an edge in G. When this algorithm finishes, 
the final state of node i is represented by Xj: Xi = 1 if node i is active and Xi = otherwise. 

Note that we allow for the possibility di = Ki{di) in which case, node i is never activated 
unless it belongs to the initial set S. Note also that the condition Ki{di) > is actually not 
restrictive. If we wish to consider a model where Ki{di) < is possible, we just need to modify 
the initial seed S so as to put node i in if Ki{di) < 0. Hence this model is equivalent to ours 
if we increase a^^ accordingly. 

Percolated threshold model: this model depends on a parameter vr G [0, 1] and a dis- 
tribution of random thresholds {K{d))deN given by t = {tse)e<s as described above. Given any 
graph G and initial set S, we now proceed in two phases. 

• bond percolation: randomly delete each edge with probability 1 — vr independently of all 
other edges. We denote the resulting random graph by G-^; 
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apply the symmetric threshold model with thresholds K{d): set = l(i € 5) and then 
as long as there is i such that X^.-^ ^Xj > Ki{di), set Xi = 1, where j i means that i 



and j share an edge in G^^ and di is the degree of node i in the original graph G. 

Clearly if vr = 1, this is exactly the symmetr ic threshold model. If in a ddition K{d) = k, then 
this model is known as bootstrap percolation ( Balogh and Pitte] . 20071 ). On the other hand if 



TT G (0, 1) and K(d) = for any d, then this is the contact process with probability of activation 
vr on each edge. Note that the percolated threshold model is not equivalent to the symmetric 
threshold model on the (bond) percolated graph since threshold depends on the degree in the 
original graph (and not in the percolated graph). 



3.2 Diffusion with degree based activation 

Recall that for integers s > and < r < s, hgr denotes the binomial probabilities bsr{p) '■= 

For a graph G, let v{G) and e(G) denote the numbers of vertices and edges in G respectively. 
In this section, we assume that the followings are given: a sequence of independent thresholds k 
drawn according to a distribution t, a parameter tt G [0,1] and an activation set drawn according 
to the distribution a. The subgraph of G{n,d) induced by the activated (resp. inactive) nodes 
at the end of the diffusion is denoted by H (resp. /). For r < s, we denote by Vgril) the number 
of vertices in / with degree s in G and r in /, i.e. the number of vertices with degree s in G 
which are not activated and with r neighbors which are not activated either. We denote by 
Vs{H) the number of activated vertices of degree s in G (and with possibly lower degree in H). 

We define the functions: 

h{z;p,t,a,TT) := ^(1 - as)ps ^ ^ rbsril - tt + nz), 

s e<s r>s-e 

g{z;p,t, a,-K) := X{p)z{l — n + nz) — h{z;p,t, a,-K), 

hi{z;p,t,a,TT) := ^(1 - as)ps ^ ts£ ^ 6sr(l - tt + vrz), 

s e<s r>s-l 

where A(p) = sps- We define 

z = z(p, t, Q;, vr) := max {z G [0, 1] : g(z\ p, t, ct, vr) = 0} . (5) 
We refer to Appendix I A. 2 1 for a justification of the use of the max in ([5]). 

Theorem 10. Consider the random graph G{n, d) (or G*{n, d) ) for a sequence d = (di)" satis- 
fying ConditionUl We are given with a sequence of independent thresholds k drawn according to 
a distribution t, a parameter vr G [0, 1] and an activation set drawn according to the distribution 
OL. For the percolated threshold diffusion on the graph G{n, d), we have: if z = 0, or if z ^ (0, 1], 
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and further g{z; a, t, p, tt) < for any z in some interval {z — e,z), then 
v{H)/n A- I — hi{z;p,t,a,Tr), 

Vsr{I)/n A ^ {I - as)Vstslhsr{z)hs-r,i{'^ - T^), 



i+£>s—r 



Vs{H)/n ps - (1 - as)pstsibsk{'^ - tt + 



TTZ) 

k>s-l 

e{I)/n 4 (^l(vr/l)^^^-^^-^ + l(7r = l)0/i(5;p,t,a,7r). 

If we condition the induced graph I* of inactive nodes in G*{n, d) on its degree sequence and 
let n^* he the number of its vertices, then I* has the distribution ofG*{n^*,d^ ). 

The proof of this theorem is given in Section 31 Note that Proposition [8] follows easily. 
3.3 Diffusion with a single activation 

In this section, we look at the diffusion with one (or a small number o(n) of) initial active 
node(s) in G(n, d). 

For u S let C(n) (resp. be the subgraph induced by the final active 

nodes with initial active node u (resp. initial active nodes l,...,k). We also define I{u) as 
the subgraph induced by the inactive nodes with initial active node u. The set of vertices of 
C{u) and I{u) is a partition of the vertices of the original graph. To ease notation, we denote 
h{z) = h{z;p, t, 0, vr), g{z) = g{z; p, t, 0, vr) and hi{z) = hi{z; p, t, 0, vr). We define 

e:=sup{ze [0,1) : <7(^)=0}. (6) 

We also define 

g{z) = (1 - TT + TTz) ^Az - SPsjl - tso) - SPstsojl - TT + TTz)^'^^ , 

^l(^) = y^Ps^so(l - TT + TTZY + y^Ps(l - tso), 
s s 

and ^ = sup {z G [0, 1) : g{z) = 0}. 

We call the following condition the cascade condition: 

Tr"^r{r -l)prtro>'^rpr, (7) 

r r 

which can be rewritten as TrE,[D(D — 1)1{K(D) = 0)] > E[D] where D is a random variable 
with distribution p. 

We denote by P = P{n, d, k) the largest connected component of the graph G{n, d, k) on 
which we apply a bond percolation with parameter vr (i.e. we remove each edge independently 
with probability 1 — vr) and then apply a site percolation by removing all vertices with ki > 1. 
The vertices of the connected graph P are called pivotal vertices: for any u € P, we have 
P C C{u). 
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Theorem 11. Consider the random graph G{n,d) (or G*{n,d)) for a sequence d = {d.i)f 
satisfying Conditions {1\ andl^ We are given with a sequence of independent thresholds k drawn 
according to a distribution t and a parameter vr € [0, 1] . 



(i) If the cascade condition ^ is satisfied, then 

viP) 



lim 



n 



Moreover, for any u € P, we have w.h.p. 



lim inf 



viCiu)) 



lim inf 



viDuepCiu)) 



> 1-/11 (0 >o, 



n n n 
where ^ is defined by Moreover if ^ = or ^ is such that there exists e > with 
g{z) < for z G {£, ~ ^yO> then we have for any u G P: 



v{C{u))/n 
Vsr{^{u))/n 

Vs{C{u))/n 
e{I{u))/n 



Pstsibsr{Obs-r,iil " Tt), 



i+i>s—r 
Ps 



^ Pstsibski'^ - TT + TT^), 



k>s-i 

l(vr/l) 



— + l(7^ = l)- 
2(l-7^+7rO 2 



HO- 



If we condition the induced graph I*{u) of inactive nodes inG*{n,d) on its degree sequence 
dp ^"^ and let n-^ he the number of its vertices, then I*{u) has the distribution of 
G*(n^*("),d^'(")). 

(a) IfTTY,^.r{r - l)rrtro < Y.r'^Pr, then for any j = o{n), v{C{l, . . . J)) = Op{n). 

A proof of this theorem is given in Section \E\ We end this section with some remarks: if 
the distribution of K{d) does not depend on d, then the cascade condition becomes with D a 
random variable with distribution p: 

In particular, K = (i.e. the diffusion is a simple exploration process), then we find the well- 
known condition for the existence of a 'giant co mponent'. Thi s corresponds to existing results in 
the literature see in particu lar Theorem 3.9 in ( Janson . 2009al ) which extend the standard result 
of (jMollov and Reedl . Il995l ). More generally, in the case vr G [0, 1] and K = (corresponding to 
the contact process), a simple computation shows that 

h{z; a.,p,-K) = {1 — a){l — IT + Trz)(j)'jj{l — TT + TTz) 
hi{z;a,p,TT) = (1 - a)<?!)z)(l - vr + vrz), 

where (poix) = E[x^] is the generating function of the asymptotic degree distribution. Applying 
Theorems [10] and [11] allow to obtain results for the con tact process. Simi l arly, the bootstrap 
percolation has been studied in r andom r egula r graphs ([Balogh and Pittel 120071 ) and random 
graphs with given vertex degrees (|Aminil . [2010l ). The bootstrap percolation corresponds to the 
particular case of the percolated threshold model with tt = 1 and K{d) = 9 > and our 
Theorems 1101 and 1111 allow to recover results for the size of the diffusion. Finally, the case where 
K{d) = qd and vr = 1 implies directly Proposition [3] 
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4 Proof of Theorem ^0 



4.1 Sketch of the proof 

It is well-known that it is often simpler to study the random multigraph G*{n,d) with given 
vertex sequence d = (dj)" defined in Section [2?T1 We consider asymptotics as the number of 
vertices tends to infinity and thus assume throughout the paper that we are given, for each n, a 
sequence d=(dj"))^ with ^.dj") eve n. We may obt ain G{n,d) by conditioning the multigraph 
G* (n, d) on being a simple graph. By ( Janson . 2009bl ). we know that the condition J2i df = 0{n) 



implies liminf P(G*(n, d) is simple) > 0. In this case, many results transfer immediately from 
G*{n, d) to G{n, d), for example, every result of the type ^{£n) for some events En, and thus 
every result saying that some parameter converges in probability to some non-random value. 
This includes every results in the present paper. Henceforth, we will in this paper study the 
random multigraph G*{n, d) and in a last step (left to the reader) transfer the results to G{n, d) 
by conditioning. 

We run the dynamic of the diffusion of Section 13.11 on a gene ral graph G*(n,d) in orde r 
to compute the final size of the diffusion in a similar way as in ( Janson and Luczak . 200?! ). 



The main point here consists in coupling the construction of the graph with the dynamic of 
the diffusion. The main difference with previous analysis consists in adding to each vertex a 
threshold with distribution t independently form the graph G*{n, d) as described in Section [3. II 
The proof of Theorem [10] follows then easily see Sect ion 14.31 In order to prove Theorem W\\ we 



use the same idea of coupling (in a similar spirit as in ( Janson and Luczak . 20091 ) for the analysis 
of the giant component) but we have to deal with an additional difficulty due to the following 
lack of symmetry: if C{u) is the final set of the diffusion with only u as initial active node, then 
for any v G C(n), we do not have in general C{u) = C{v). We take care of this difficulty in 
Section m In the next section, we present a preliminary lemma that will be used in the proofs. 



4.2 A Lemma for death processes 

A pure death process with rate 1 is a process that starts with some number of balls whose 
lifetime are i.i.d. rate 1 exponentials. Each time a ball dies, it is removed. Now consider n bins 
with independent rate 1 death processes. To each bin, we attach a couple {di,ki) where di is 
the number of balls at time in the bin and ki = Ki[di) is the threshold corresponding to the 
bin. We now modify the death process as follows: all balls are initially white. For any living 
ball, when it dies, with probability 1 — vr and independently of everything else, we color it green 
(an leave it in he bin) , otherwise we remove it from the bin. Let W^""^ (t) and Gj^^ (t) denote the 
number of white and green balls respectively in bin j at time t, where j = 1, . . . , n and t > 0. 

Let U^s^ii{t) be the number of bins that have s balls at time and r white balls, i green balls 

at time t and threshold £, i.e. U^J^l^{t) = \{j G [n], W^''\t) = r, Gf\t) = i, dj = s, kj = 
In what follows we suppress the superscripts to l ighten the notation. The following lemma is an 
extension of Lemma 4.4 in ( Janson and Luczakl . 200?! ): 



Lemma 12. Consider the n independent processes defined above and assume that the sequence 
d = {di)1 satisfies Condition [1\ where p = {pr)'^o '-'^'^ ^ defective probability distribution: 
'^j.P^' — ^- assume that the thresholds {(-?^i(c?))^i}ig[n] o,re independent in i and have 
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distribution t = (t. 



se)o<e<i 



as described in Section [J7J[ Then, with the above notation, as n ^ oo. 



,,,(l-7r)| 4 0. 



(8) 



e<s r=0 1=0 



Proof. Let n^i = \{i : di = s,ki = i}\. In particular J2r,iUsriA^) = UssO/iO) = ngi. First 
fix integers s,i and j with < j < s. Consider the rigi bins that start with s baUs and with 
threshold i. For k = 1, . . . ,ns£, let be the time the jth. bah is removed or recolored in the 
kth such bin. Then \{k : < t}\ = X^*=o Si=o Moreover, for the kth. bin, we have 

^ t) = J2t=o bfi r{e~^). Multiplyin g by ngi/n and using Glivenko-Cantelli theorem (see e.g. 
Proposition 4.24 in (jKallenbergj . |2002| )). we have 



sup 

t>0 



^ S-] s-r ^ S-] 



0. 



r=0 1=0 r=0 

Since Usi/n — >^ Pstsi (by Condition [TJi) and the law of large numbers), we see that 



sup 



^ s—r 

- Usri,i{t) - Pstsibsr{e~ 



j=0 







The law of Gk{t) given dk = s and Wk{t) = r < s is a Binomial distribution with parameters 
s — r and 1 — vr. Hence by the law of large numbers, we have 



sup 



Usri,e{t) 



n 



Pstsibsr{e *)6s_r.,j(l - vr) 



0. 



Hence each term in ([8|) tends to in probability. The same holds for any finite partial 
sum. Let e > and S be such that X^^s' < e. By Condition [TJiii), we have Yls i^'^st/n = 
"Yjs ds/n — >■ A = Yls ^Ps- Hence, also 'Yl,s>s ^'^sil'n — )• ^Y^=s ^ ^- ^° ^^^^ sufficiently 
large n, we get Yls>s susi/n < e and 



^'^V^^^^^^^^'r\UsriA'^)h - Pstsibsrie *)5s-r,j(l -vr)| 

s>S £ r=0 j=0 
s s—r 

< sup r {UsriA'>^)/'^+Ps'tslbsr{e^^)bs-r,i{'^ " ^)) 

/>n ^ — — — — 
- s>S i r=0 i=0 

< ^ ^ sin^iln + pst^^) < 2e, 

and the lemma follows. 



s>S e 



□ 



4.3 Proof of the diffusion spread 



Our proof of Theorem [TD] is an adaptation of the coupling argument in (|Janson and Luczakl . 



20071 ). We start by analyzing the symmetric threshold model. We can view the algorithm of 



Section 13.11 as follows: start with the graph G and remove vertices from S, i.e. the initially 
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active nodes. As a result, if vertex i has not been removed, its degree has been lowered. We 
denote by the degree of i in the evolving graph. Then iteratively remove vertices i such that 
df < di — ki. All removed vertices at the end of this procedure are active and all vertices left are 
inactive. It is easily seen that we obtain the same result by removing edges where one endpoint 
satisfy df < di — ki, until no such edge remains, and finally removing all isolated vertices, which 
correspond to active nodes. 

Regard each edge as consisting of two half-edges, each half-edge having one endpoint. We 
introduce types of vertices. We set the type of vertices in the seed S to B. Say that a vertex (not 
in S) is of type A if df > di — ki and of type B otherwise. In particular at the beginning of the 
process, all vertices not in S are of type A since df = di and all vertices in S are by definition 
of type B. As the algorithm evolves, df decreases so that some type A vertices become of type 
B during the execution of the algorithm. Similarly, say that a half-edge is of type A or B when 
its endpoint is. As long as there is any half-edge of type B, choose one such half-edge uniformly 
at random and remove the edge it belongs to. This may change the other endpoint from A to 
B (by decreasing d^) and thus create new half-edges of type B. When there are no half-edges 
of type B left, we stop. Then the final set of active nodes is the set of vertices of type B (which 
are all isolated). 

As in ( Janson and Luczak . 20071 ). we regard vertices as bins and half-edges as balls. At 



each step, we remove first one random ball from the set of balls in i?-bins and then a random 
ball without restriction. We stop when there are no non-empty S-bins. We thus alternately 
remove a random i3-ball and a random ball. We may just as well say that we first remove a 
random B-haW. We then remove balls in pairs, first a random ball and then a random B-hall, 
and stop with the random ball, leaving no B-ball to remove. We change the description a little 
by introducing colors. Initially all balls are white, and we begin again by removing one random 
i3-ball. Subsequently, in each deletion step we first remove a random white ball and then recolor 
a random white i?-ball red; this is repeated until no more white S-balls remain. 

We now run this deletion process in continuous time such that, if there are j white balls 
remaining, then we wait an exponential time with mean 1/j until the next pair of deletions. In 
other words, we make deletions at rate j. This means that each white ball is deleted with rate 
1 and that, when we delete a white ball, we also color a random white S-ball red. Let A{t) and 
B(t) denote the numbers of white A-balls and white S-balls at time t, respectively, and Ai{t) 
denotes the number of A-bins at time t. Since red balls are ignored, we may make a final change 
of rules, and say that all balls are removed at rate 1 and that, when a white ball is removed, a 
random white S-ball is colored red; we stop when we should recolor a white i?-ball but there is 
no such ball. 

Let r be the stopping time of this process. First consider the white balls only. There are no 
white S-balls left at r, so B(t) has reached zero. However, let us consider the last deletion and 
recoloring step as completed by redefining B{t) = —1; we then see that r is characterized by 
B{t) = —1 and B{t) > for < t < r. Moreover, the ^-balls left at r (which are all white) are 
exactly the half-edges in the induced subgraph / of inactive nodes. Hence, the number of edges 
in this subgraph is ^A{t), while the number of nodes not activated is Ai{t). 

If we consider only the total number A{t) + B{t) of white balls in the bins, ignoring the 
types, the process is as follows: each ball dies at rate 1 and upon its death another ball is also 
sacrificed. The process A{t) + B{t) is a death process with rate 2 (up to time r). Consequently, 
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by Lemma 4.3 of ( Janson and Luczak . 200?] ) (or Lemma [T^ above) . we have 

Op(n), 



sup U(i) +B{t) -nXe 

t<T 



-2t\ 



(9) 



since Condition [T] (iii) implies r\{i : = r}|/n — )• A . 

Now if we consider the final version of the process restricted to ^d-bins, it corresponds exactly 
to the death process studied in Section W?2\ above with tt = 1. We need only to compute the initial 
condition for this process. For a degree based activation, each vertex of degree s is activated (i.e. 
the corresponding bin becomes a B-hm) with probability a^. Hence by the law of large numbers, 
the number of yl-bins with initially s balls and threshold £ is Ugi = (1 — as)pstsen + Op{n). With 
the notation of Lemma [T2l we have 



A{t) 



^ rUsro,i{t), and, Ai{t) = ^ UsrG/{t), 



s>l, r>3 



s>l, r>s-l 



with the defective probability distribution (1 — as)ps- Hence by Lemma [T2] we get (recall that 
TT = 1 here): 



sup 

t<T 



A{t) 



n 



s>l,r>s-£ 



r(l - as)PstsfhsT{e 



It is then easy to finish the proof as in ( Janson and Luczak . 200?! ) for this model (the complete 
argument is presented below for the more general percolated threshold model). In particular, it 
ends the proof of Theorem [10] for the case vr = 1 . 

We now consider the percolated threshold model with vr < 1. We modify the process as 
follows: for any white j4-ball when it dies, with probability 1 — tt, we color it green instead of 
removing it. A bin is of type A \i r + i > s — where r is the number of withe balls in the bin, 
i the number of green balls (which did not transmit the diffusion) and s and £ are the initial 
degree and threshold. Let A{t) be the number of white A-balls. By Lemma [T2| we now have: 



sup 

t<T 



sup 

t<T 



A{t) 



n 



n 



'^(l - as)Psts£bsrie ^)bs-r,iil - vr) 

s,r+i>s—£ 

- ^ (1 - as)psis£&sr(e~*)6s_r.,j(l - vr) 

s,r+i>s—i 



0, 



0. 



In particular, we have thanks to Lemma [Til (in Appendix lA.lj) for t < t, 



A{t)/n 



1 — vr + vre 



h{e *;p,t,a,Tr) + Op{n), Ai{t)/n = hi{e *;p,t,cx,7T) + Op{n). (10) 



By looking at white balls (without taking types in consideration), we see that Equation ([9]) is 
still valid. Hence, we have 



sup 

t<T 



B{t) _ e-* 
n 1 — vr + vre' 



Ztdie *;p,t,a,vr) 



0. 



(11) 
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For simplicity we write g{z) for g(z;p,t,cx,TT). Assume now that ti > is a constant 
independent of n with ti < — \nz so that z < I and g{l) > 0. Hence, we have g{z) > for 
z £ {z, 1] and thus g{e~*) > for t < ti. We can find some c > such that g{e~^) > c for t < ti. 
But 5(r) = -1, so if r < then j^^^^gie'^) - B{T)/n > c ^_^~l\^t, and from we 
have P(r < ti) — 0. In case i = 0, we may take any finite ti and find r — oo and pO]) with 
t ^ oo, yields that 

lim A{t) = Op{n), lim Ai{t) = nhi{0;p,t,a,7r) + Op{n). 

t— i-oo t— s-oo 

In case z > 0, by the hypothesis we can find t2 £ (— In -2, — ln(z — e)) such that g{e~^^) = —c < 0. 
If T > t2, then B{t2) > and thus B{t2)/n - ^-^^^^(e-*^) > cj^^^^. Hence by (HH), 
we have P(r > — > 0. Since we can choose ti and ^2 arbitrarily close to — Inz, we have 
T — — In z and (fTO]l with t = r yields that 

TIZ 

= ■ -h{z;p,t,a,Tr) + Op(n), ^i(r) = nhi{z;p,t,a,7r) +Op{n). 

1 — TT + vrz 

Note that v{H) = n — ^i(t) and e(/) = |^(r). Hence we proved Theorem [TD] for v{H)/n and 
e{I)/n. The results for Vsr{I)/n and Vs{H)/n follows from the same argument, once we note 
that 

Vsr{I)= ^ Usri/{T) Wld, Vs{H) = \{j : dj = s}\- ^ t/sri,^^)- 

r+j>s— £ r+i>s— £ 

Finally, the statement concerning the distribution of the induced subgraph /* follows from the 
fact that this subgraph has not been explored when previous algorithm stops. 



5 Proof of Theorem [TT] 

We start this section with some simple calculations. We define for z G [0, 1], 
o-{z) = ^^Ps'^tsi rbsriz), 

s e<s r>s-l 

h{z) = a(l — TT + vrz), 

hl{z) = '^Ps'^tsi ^ bsr{l - TT + TTZ), 
s l<s r>s-l 

h2{z) = hiz) if vr < 1 and h2{z) = h(z) otherwise, 

1 — vr + vrz 

g{z) = Xz{l — TT + TTz) — h{z), 

f{z) = g{z) if vr < 1 and f{z) = g{z) otherwise. 

1 — vr + vrz 

For s > 1 and £ > 1, we have 

^ rbsriz) = s'^z'-^+ r{r-sz)(^]z''-Hl-z) 

r>s-e s-l>r>s-e ^ ^ 

= s^z"*"^ - y r{sz-r)——^ — -bs^ir{z), 
^-^ z(s — r) 

s-l>r>s-e ^ ' 



s— r— 1 



27 



so that for z G [0,1], we have ^ X]r>s-^ ^^sr(^) 
differentiable on [0, 1] and we have 



< + s^. 



Hence by Condition O a is 



s,£ \ s-l>r>s-l ^ ^ J 

In particular, we have 

a'(l) = Y.pstse{s^ -lii>l)s{s-l)) 

s,£ 

= y^^pstsos{s - 1) + A, 

s 

SO that we have g'{l) = A(l + vr) - 7ra'(l). Note also that /(I) = g{l) = and /'(I) = 

Consider now the case (ii) where irJ^sPstsosis — 1) < A, so that g'{l) = /'(I) > 0. The 
proof for an upper bound on n~^u(C(l, . . . , j)) follows easily from Theorem [TOl Take a parameter 
a = (arf)rfgN with = a > for all d. Clearly the final set of active nodes H{a) will be greater 
than for any seed with size Op{n). Now when a goes to zero, the fact that /'(I) < ensures that 
z{a) ^ 1 in Theorem 1101 so that \iina^Qlimn~^oov{H{a))/n = 0. Hence point (ii) in Theorem 
[ri] follows. 

We now concentrate on the case where the cascade condition holds. In particular we have 
^'(1) < so that ^ defined in ([6]) by ^ = sup{z € [0, l),g{z) = 0} is strictly less than one and 
we have 



f{z)>0, VzG (e,l) 



(12) 



Also as soon as there exists e > such that g{z) < for z G (^ — e,^), we can use the same 
argument as above. Since, we have in this case z{a) — > ^ as a ^ 0, it gives an upper bound that 
matches with the st atement (i) of Theorem llli In order to prove a lower bound, we follow the 
general approach of ( jjanson and Luczak . 20091 ). We modify the algorithm defined in Section [4.31 
as follows: the initial set S now contains only one vertex, say vi. When there is no half-edge 
(or ball) of type B, we say that we make an exception and we select a vertex (or a bin) of type 
A uniformly at random among all vertices of type A. We declare its white half-edges of type B 
and remove its other half-edges, i.e. remove the green balls contained in the corresponding bin 
if there are any. Exceptions are done instantaneously. 

For any set of nodes vi, . . . ,Vk, let C{vi, . . . , Vk) be the subgraph induced by the final active 
nodes with initial active nodes vi,...,Vk- If 5 = {vi}, then clearly when the algorithm has 
exhausted the half-edges of type B, it removed the subgraph C{vi) from the graph and all edges 
with one endpoint in C{vi). Then an exception is made by selecting a vertex say V2 in G\C{vi). 
Similarly when the algorithm exhausted the half-edges of type B, it removed the subgraph 
C{vi,V2) and all edges with one endpoint in this set of vertices. More generally, if k exceptions 
are made consisting of selecting nodes vi, . . .Vk, then before the k + 1-th exception is made (or 
at termination of the algorithm if there is no more exception made), the algorithm removed the 
subgraph C{vi, . . . , Vk) and all edges with one endpoint in this set of vertices. 

We use the same notation as in Section [4.31 In particular, we still have: 



supU(i) +B{t) -nXe 



-2t\ 



Op{n) 



(13) 
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We now ignore the effect of the exceptions by letting A{t) be the number of white A balls if no 
exceptions were made, i.e. assuming B{t) > for all t. If dmax = maxj di is the maximum degree 
of G*{n,d), then we have A{0) = A{0) G [n - dmax,"-]- By Condition [D (iii), dmax = 0(n^/2), 
and thus n~^(imax = o(n). Hence we can apply results of previous section: 



sup 



Ait) 



n 



rpstsihsr{e *)6s-r,j(l - vr) 



s,r+i>s—l 



(14) 



We now prove that: 



< A{t) - A{t) < sup {A{s) - A{s) - B{s)) + d„ 

0<s<t 



(15) 



The fact that A(t) > A{t) is clear. Furthermore, A(t) — A(t) increases only when exceptions are 
made. If an exception is made at time t, then the process B reached zero and a vertex with j 
white half-edges is selected so that B{t) = j — 1 < d^^x- Hence we have 

A{t) - A{t) < A{t) - A{t) - B{t) + dniax. 

Between exceptions, if A decreases by one then so does A, hence A{t) — A{t) does not increase. 
Consequently if s was the last time before t that an exception was performed, then A(t) — A(t) < 
A{s) - A{s) and follows. 

Let B{t) = A{t) + B{t) - A{t), then we have 



sup 

t>0 



n 



fie' 



0. 



Equation (llSp can be written as 



< A{t) - A{t) < - inf B{s) + dn 

s<t 



(16) 



(17) 



We first assume that ^ given by ^ is such that ^ > and there exists e > such that 
g{z) < and hence f{z) < for z e (^ - e,^). Let r = - In^. Then by ([T2|), we have /(e"*) > 
for < t < r so that inft<^ /(e~*) = /(I) = and hence by p^ . 

(18) 



min-^B{t) 4 0. 

t<T 

By Condition [1] (iii) , n~^(imax = o{n). Consequently, (fTTl) yields 

supn"^|^(t) - B{t)\ = supn-^|i(t) - A{t)\ 4 0, 

t<T t<T 



(19) 



and thus by (116 



sup 



Bit) 



n 



fie- 



0. 



(20) 



By assumption, there exists e > sufficiently small for /(e~'^~'^) < 0. Since /(e~*) > on 
the interval [e,r — e], (pOj) implies that w.h.p. B{t) remains positive on [e, r — e], and thus no 
exception is made during this interval. 
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On the other hand, /(e < and (fTUj) imphes n ^B{t + e) = /(e ^ + Op{n), while 
B{t + e) > 0. Thus with 5 = -/(e-^~^)/2 > 0, w.h.p. 

A{t + e) - A{t + e) = B{t + e) - B(t + e) > -^(r + e) > n5, (21) 

while (dlD yields A{t)-A{t) < nJ w.h.p. Consequently, w.h.p. A{T + e)-A{T+e) > A{t)-A{t) 
and an exception is performed between r and r + e. 

Let Ti be the last time an exception was performed before r/2 and let T2 be the next time 
it is performed. We have shown that for any e > 0, w.h.p. < Ti < e and t — e <T2 <r + e. 

Lemma 13. Let Tj* and T2 be two random times when an exception is performed, with < T2 , 
and assume that A- ti and Tg* A t2, where < ti < t2 < t. If v{T*) is the number of vertices 
removed by the algorithm by time T* , then 

v{T*)-v{T*) ^ ^22) 

In particular, if ti = t2, then ^(Tg*) — v(T^) = Op{n). 

Proof. By definition, we have: 

v{T^) - v{Tl) = Ai{Tt-) - Ai{T^-). 

Since Tg* t2 < t and / is continuous, inft<T2* ^i^U<t2 = 0) and (fT6|) and p!7|) 

imply, in analogy with (fT8|) and (fT9]l . infj<j'* B{t) A and 

sup n-i|i(t) - 4 0. (23) 



Let Ai{t) be the number of bins if no exceptions were made. Then we clearly have ^i(t) — 
Ai(t) < A{t) — A{t) since each time an exception is made Ai{t) — Ai{t) increases by one while 
A{t) — A{t) increases by more than one. Hence (|22p follows from results in previous section. □ 

Let C (resp. C") be the subgraph induced by the vertices removed by the algorithm between 
and Ti (resp. T2). By Lemma [T3l we have 

v{C')/n A (24) 
y^C")/n A /ii(l)-/ii(e) = 1-/11 (0. (25) 

Hence informally, the exception made at time Ti triggers a large cascade. 

Now consider the case = 0. Note in particular that we have /i(0) = and since hi{z) < h(z), 
we also have /ii(0) = 0. Then with the same argument as above, we have that B{t) remains 
positive on [e, +00) and thus no exception is made after a last exception made at time Ti with 
Ti 0. Hence (p^ and (|25]) follow with C" being the whole graph (except possibly Op{n) 
vertices), /ii(0) = 0. 

We now finish the proof of Theorem [TT] in the c ase wh ere ^ = or there exists e > such 



that g{z) < for z G (^ - e,0- First bv l.Tansonl . \20Q9fh Theorems 3.5 and 3.9, the cascade 
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condit ion imp l ies tha t v{P) = ilp(n). The result lim„ ^^^^ = 1 — /ii(^) > could be derived 
from (Janson, 2009al ). We give an alternative proof for this result at the end of this section. 



For now, we denote 7 = 1 — /ii(^) > 0. Coming back to the diffusion process analyzed above, 
we clearly have C Ci P = 0. We now prove that P C C". This is clear in the case ^ = 0. We 
now concentrate on the case > 0. First, let T3 be the first time after T2 when an exception is 
made. Since A{t) — A{t) increases by at most dmax = o{n) each time an exception is made, we 
obtain from (|23p : 

sup{A{t) - A{t)) < sup(i(t) - A{t)) + dmax = Op{n). 

t<T'i t<T2 



Hence similarly as in (j2ip . we have for every e > 0, w.h.p. r + e > T3. Since also T3 > T2 — r, 
it follows that T3 — >p r. If C" is the subgraph removed by the algorithm between T2 and 
T3, then Lemma [13] implies that v{C"') = Op{n). Assume now that C" r\ P = 0, then with 
probability at least 7 > 0, the vertex chosen for the exception at T2 belongs to P and then we 
have P(C"'has more than 771 vertices) > 7, in contradiction with v{C"') = Op{n). Hence we have 
C" n P / and then P C C" as claimed. 

We clearly have for any u & P: 

v{C{P)) = v{C{u)) = v{nuepC{u)). 

Hence we only need to prove that v{C{P)) > 1 — /ii(^)n + Op{n). To see this, attach to each 
vertex i a random variable Ui uniformly distributed over [0, 1]. Each time an exception has to 
be made, pick the vertex among the remaining ones with minimal Ui so that we do not change 
the algorithm described at the beginning of the section. From the analysis above, we see that 
all exceptions made before Ti are vertices not in P and the exception made at time Ti belongs 
to P. Now consider the graph G obtained from the original graph where C has been removed 
but all other variables are the same as in the original graph. Since v{C') = Op{n), this graph 
satisfies Conditions [T] and [2j Hence previous analysis applies and we have in addition that the 
first exception made by the algorithm belongs to P since a pivotal vertex in G is also pivotal in 
G. Hence the subgraph of G removed between times Ti = and T2 by the algorithm is exactly 
C{P) in G. Since G is a subgraph of G, we have C{P) C C{P) in the original graph. And the first 
claim in (i) follows from ()25p applied to the graph G. The second claim in (i) follows exactly as 
in the proof of Theorem [10] given in previous section. 

Now consider the case where > but for any e > 0, there exists z E — e, such that 
g{z) > and hence f{z) > 0. The idea to get a lower bound is to let vr vary. Since vr > 0, 
for any < vr' < vr, we see that by a standard coupling argument, for any given initial seed all 
active nodes in the model with vr' will also be active in the model with vr. Hence the model with 
vr' provides a lower bound for the number of active nodes in the model with vr. Now consider 
the function g(z, vr) = Xz(l — vr(l — z)) — a(l — vr(l — z)) as a function of vr. We have 

|^(z,vr) = (l-z) [a' {1- nil -z))-Xz] 
Since ^ is a local minimum of 2; 1— ?• g{z, vr) and g is differentiable as a function of z, we have 

^(e, ^) = 4^ a'(l - vr(l - e)) = -(1 - ^(1 - 0) + 
oz vr 

Hence we have ^(^, vr) = ^(1 — ,^)(1 — vr(l — ^)) > 0. In particular for vr — e < vr' < vr, we 
have 5(C,vr') < g{C,Tr) = 0. Let ^(vr') = sup{2; G [0,1), g{z,7r') = 0}, then we have ^(vr') > ^ 
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for any vr' < vr and g{z,TT') < for z € {(,,(,{Tr')). Moreover, we have ^(vr') — )• ^(vr) as vr' — > tt 
and previous argument is valid for the model with vr' as close as desired from vr showing that 
1 — /ii(^(7r')) — > 1 — hi{(^) is a lower bound for the fraction of active nodes in the model with vr. 

We finish this proof by computing the asymptotic fo r the size of P using our previous analysis 
but for a modified threshold as done in ( Lelarge . 20081 ). We add a bar for quantities associated 



to this new model. Namely, consider a modification of the original diffusion with threshold 
Ki{di) = {di + \)\{Ki[di) > 1). In words, a node i becomes active if one of its neighbor is 
active and Ki{di) = in the original diffusion model. Clearly the nodes that become active in 
this model need to have only one active neighbor in the original diffusion model with parameter 
Kj{dj). We denote by C{u) the subgraph of final active nodes with initial active node u. Note 
that our algorithm is an exploration process of the components of the graph on which we apply 
a bond percolation with parameter vr and a site percolation by removing all vertices with fe, > 1. 
In particular, the largest component explored is exactly P. Note that the computations made 
before are valid with the following functions: 

g[z) = {I - TT + ■KZ)\^Z -'^SPsil -tso) -'^SPstsoil - + T^zy~''^ 

hi{z) = y^pg(l - tso) + y^y>stso(l - vr + vrz)^. 

s s 

Hence we have ^ = supjz € [0, 1], g{z) = 0}. Note that if we denote (t){z) = g{z){l — vr + vrz)"^, 
then we have 

nzY-\ 



(j)'{z) = A - vr g(g - l)pstso(l - vr + 

s 

4>"{z) = -vr2^s(s-l)(s-2)p,t,o(l-vr + 



vr^r^^. 



In particular, cp is concave on (0, 1] and strictly concave unless ps = for s > 3. Note also that 
= A — vr^^ s(s — l)pstso, so that under the cascade condition </>'(!) < and (j) is strictly 
concave. Hence, we have ^ < 1 and > for x G (^, 1) and if ^ > 0, then < for x < ^. 
In particular, previous analysis allows to conclude that lim„ = 1 — hi{^) > 0. 



6 Proof of Propositions [6] and [7] 

We start with the proof of Proposition [71 By Theorem [TT| when the cascade condition holds, 
the set of active vertices contain the set of pivotal vertices and hence has a giant component. 
We denote Z the induced subgraph of inactive vertices, in the pivotal equilibrium (i.e. in the 
final state when all pivotal nodes are initially active) . By Theorem [TT| we have for Vr (1) , the 
number of vertices in I with degree r in Z: 



V (Z^ I 

^ ^ Psfesr(g), where, g = g(A) = max < z < 1, Az^ = y^ps ^ rb, 

r>s—[qs] I s ^^s—lqs] 



sr 



Z 
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We denote Vr{z) = Ylr>s-lqs\ Psbsr{z). Thanks to the result on the distribution of I, there is a 
giant component of inactive vertices if 

r r s r>s-lqs\ 

which can be rewritten as in Proposition [71 

We now prove Proposition [6l We assume that Pr = tt = 1 and K{d) = lqd\. The 

function ip{X) = e^^ Sr<(j-i (^-2)! increasing for A < A* and then decreasing. We assume that 
q is fixed such that the cascade condition holds. Then > 1- Then Aj(g') = sup{V'(A) < 

1,A < A*} while As(g) = inf{'0(A) > 1, A > A*}. We denote 



C(A) = max < 



z<l, Az^ = ^Ps ^ r{r-\)hsr{z) 

s i'~>^s—\cis\ 



Then, there is a giant component of inactive vertices if C,{}^ < '^(A). Both functions are non- 
increasing in A € {Xi{q), Xs{q)) and are intersecting only once in {\i{q), Xs{q)). 



7 Conclusions 

This paper analyzes the spread of new behaviors or technologies in large social networks. Our 
analysis is motivated by the two qualitative features of global cascades in social and economics 
systems: they occur rarely, but are large when they do. 

In our simplest model, agents play a local interaction binary game where the underlying so- 
cial network is modeled by a sparse random graph. Considering the deterministic best response 
dynamics, we compute the contagion threshold for this model. We find that when the social net- 
work is sufficiently sparse, the contagion is limited by the low connectivity of the network; when 
it is sufficiently dense, the contagion is limited by the stability of the high-degree nodes. This 
phenomenon explains why contagion is possible only in a given range of the global connectivity. 

Most importantly, we identify the set of agents able to trigger a large cascade: the pivotal 
players, i.e. the largest component of players requiring a single neighbor to change strategy in 
order to follow the change. The notion of pivotal player is crucial in the context of random 
graphs. Cascades occur only when pivotal players represent a positive fraction of the population 
and in this case, any cascade will be triggered by such a pivotal player. We found that these 
pivotal players exist only if the connectivity of the network is in a given range. At both ends of 
this range (i.e. for low and high-connectivity), the number of pivotal players is low. However 
in the high-connectivity case, we found that the system displays a robust-yet-fragile quality: 
while pivotal players are very rare, they trigger very large cascades. This feature makes global 
contagions exceptionally hard to anticipate. We also analyze possible equilibria of the game and 
in particular, we find conditions for the existence of equilibria with co-existent conventions. 

Motivated by social advertising, we also consider cases where the number of pivotal players 
is a negligible fraction of the population. In this case, contagion is still possible if the set of 
initial adopters is sufficiently large. We compute the final size of the contagion as a function of 
the fraction of the initial adopters. We find that the low and high-connectivity cases still have 
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different features: in the first case, tlie global connectivity helps the spread of the contagion 
while in the second case, high connectivity inhibits the global contagion but once it occurs, it 
facilitates its spread. 

Finally, we analyze a general percolated threshold model for the diffusion allowing to give 
different weights to the (anonymous) neighbors. Our general analysis gives explicit formulas for 
the spread of the diffusion in terms of the initial condition, the degree sequence of the random 
graph, and the distribution of the thresholds. It is hoped that this analysis will stimulate 
applications of our res ults to other pract i cal ca ses. One such possibi l ity is the study of financial 
networks as started in Gai and Kapadia ( 2O10l ) and Battiston et al. ( 20091 ). 
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A Appendix 

A.l Technical lemma 

Lemma 14. For any x, tt G [0, 1], and k >0 we have 



573-590. 




36 



Proof. This follows from the following observations for x > 0: 



V- - TT + xvr) = V 6s_i,^(l-7r(l-x)) 

^— ' 1 — TT + XTT 

r>k r>k—l 



S - 1) ^ &s-i,r(vr(l - x)) 

r<s—k 
s-1 

s-l)¥{Y^B,Yi<s-k), (26) 



1=1 



where the -Bj's and l^'s are independent Bernoulli random variables with parameter vr and 1 — x 
respectively. Now we also have: 



X 

r+i>k r+i>k~l 



-bs,r(.x)bs-r,i{l - T^) = {s - I) ^ bs-l,r{x)bs-l-r,i{l - T^) 

r+i>k~l 

s-1 s-1 

{s-l)¥{Y,{l-Yi) + Y,Yii^-B^)>k-l), 



1=1 1=1 

which corresponds exactly to (p6]) . □ 
A. 2 Properties of the functions /i and 

We now justify the use of the max in (0) . Let o'{d) be a Bernoulli random variable with parameter 
ad- Let for < x < 1, be the thinning of D obtained by taking D points and then randomly 
and independently keeping each of them with probability x. Thus given D = d, ~ Bi((i, x). 
With these notations, we have 

h{z;p,t,(X,7r) = E[Di_^+^,{l-a{D))l{Di^^+^,>D-K{D))] 
hi{z;p,t,cx,Tr) = F{a{D) = 0, Di^^+^,>D-K{D)), 

so that both h and hi are non-decreasing in z and non-increasing in vr. 

Note that if ct = 0, then /i(l; p, t, 0, vr) = A so that g{l;p,t,0,7r) = and z = I. We 
now consider the case a 7^ 0, so that there exists d > 1 such that Od > 0. In this case, we 
have g{l; a,p,-ir) = A — ^^^^(l — as)ps > adPdd > 0. The statement then follows from the 
fact that the only possible jumps for z 1— )• g(z;p,t, ct,7r) are downwards. More precisely, let 
z = sup{z G [0,1], g{z;p,t, a, tt) = 0}. Since the function h is non-decreasing in z, its set of 
discontinuity points is denumerable say {zjjjgN and h admits a left and right limit at any point 
z denoted by h{z—) and h{z+) respectively. If i G {zi}, then we have h{z—) < Az(l — vr -|-vri) < 
h{z+). In particular, we have g{z;p,t,a.,7r) < for any z > z which contradicts the fact that 
g{l;p, t, OL, vr) > 0. Hence the functions h and g are continuous at z and the sup is attained and 
can be replaced by a max. 
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A. 3 Proof of Lemma O 



Taking expectation in we get with x = E[y], 

1-x = ^p:il-as+i)¥lj2BiY,<q{s + l) 



s>0 



(l-Qi)y + 2^ 2^ bsjixir) 

s>l j<q{s+l) 



1 



(1 - ai)pi + ^ {I - as+i)ps+i{s + l)bsj{l 



Note that (s + l)6sj(l — xvr) = /J^"^^ ^s+ij+i(l — xvr) for s > 1, so that 



1 — X 



A(l - xvr) 



1 — xvr 



(l-ai)pi+ (1 - Qs+i)ps+i&s+ij+i(l - 2;7r) 

, , 1 — xvr 



s>i;i>s-L<?«J 



Hence, we get 



(1 - X7r)(l - x)A = ^ j(l - as)psfesi(l 



xvr 



and the first part of the lemma fohows. Taking expectation in ^ gives: E[X0] = ^ — J2s-j>s-e(^~ 
as)pstsibsji^ — xtt), and the second part of the lemma follows. 
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